Auditory masking threshold estimation for broadband noise sources with application to speech enhancement

نویسندگان

  • Ruhi Sarikaya
  • John H. L. Hansen
چکیده

This paper addresses issues encountered in the use of an Auditory Masking Threshold (AMT) for speech enhancement and proposes an algorithm to improve AMT estimation for broadband noise sources. We determined that while AMT estimation is fairly accurate, and hence an enhancement scheme based on AMT can suppress audible noise to a greater extent for low frequency colored noise sources, the algorithm fails to converge to the clean speech AMT for broadband communication channel noise. We propose a new AMT estimation scheme and incorporate the proposed algorithm into a previously developed enhancement framework [2].We evaluate our algorithm on a set of sentences obtained from the standard TIMIT database for at communications channel noise (FLN), and automobile highway noise (HWY) at 5 dB and 0 dB SNR levels, respectively. Evaluations were performed for 8 kHz and 16 kHz sampled speech and performance is measured with both objective and subjective assessment methods. The results show that the new AMT codebook based enhancement method is more e ective than traditional AMT methods. Also, that traditional AMT methods may not be as e ective for reduced bandwidth speech (4 kHz), or broadband interference, but that alternative AMT estimation methods can help improve convergence properties.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...

متن کامل

Perceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech

A new Time-Frequency (TF) representation of speech signal is introduced and used for speech enhancement. TF representation and speech enhancement algorithm are both based on perceptual properties of human auditory system in which the concept of band analysis is exploited. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing Critical Bands (CB) ...

متن کامل

Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters

A speech enhancement technique based on the temporal masking properties of the human auditory system is presented. The noisy signal is divided into a number of sub-bands with fractional bark accuracy, and the sub-band signals are individually and adaptively weighted in the time domain according to a short-term temporal masking threshold to noise ratio estimate in each subband. Objective measure...

متن کامل

A Time-Frequency Adaptation Based on Quantum Neural Networks for Speech Enhancement

In this paper, we propose a novel wavelet coefficient threshold (WCT) depended on both time and frequency information for providing robustness to non-stationary and correlated noisy environments. A perceptual wavelet filter-bank (PWFB) is firstly used to decompose the noisy speech signal into critical bands according to critical bands of psycho-acoustic model of human auditory system. The estim...

متن کامل

A Perceptual Approach to Reduce Musical Noise Using Critical Bands Tonality Coefficients and Masking Thresholds

Traditional noise reduction techniques have the drawback of generating an annoying musical noise. A new scheme for speech enhancement in high noise environment is developed by considering human auditory system masking characteristics. The new scheme considers the masking threshold of both noisy speech and the denoised one, to detect musical noise components. To make them inaudible, they are set...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999